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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gregor Meyers, Tillmann Rumenapf, 
Heinz-Jurgen Thiel 

(ii) TITLE OF INVENTION: Hog cholera virus vaccine and diagnostic 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Organon Teknika Corporation 

Biotechnology Research Institute 

(B) STREET: 1330-A Piccard Drive 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20850 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 494,991 

(B) FILING DATE: 16 March 1990 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: William M. Blackstone 

(B) REGISTRATION NUMBER: 29,772 

(C) REFERENCE/DOCKET NUMBER: 



(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (301) 258-5200 
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(2) INFORMATION FOR SEQ ID NO:l: 

V^rf (i) SEQUENCE CHARACTERISTICS: 

^ w (A) LENGTH: 12284 base pairs 

c? J (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hog cholera virus 

(B) STRAIN: Alfort 
(H) CELL LINE: PK 15 and 38A1D 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 364.. 12060 

(D) OTHER INFORMATION: /label= 435JtDA _j>rotein 

(ix) FEATURE: 

(A) NAME/KEY: primer_bind 

(B) LOCATION: complement (2587.. 2619) 
(D) OTHER INFORMATION: /label= primer_l 

(ix) FEATURE: 

(A) NAME/KEY: primer_bind 

(B) LOCATION: complement (2842.. 2880) 
(D) OTHER INFORMATION: /label= primer_2 

(ix) FEATURE: 

(A) NAME/KEY: variation 

(B) LOCATION: replace (127, "c") 

(ix) FEATURE: 

(A) NAME/KEY: variation 

(B) LOCATION: replace (1522, "g") 

(ix) FEATURE: 

(A) NAME/KEY: variation 

(B) LOCATION: replace (10989, "t") 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTTAGCTCTT TCTCGTATAC GATATTGGAT ACACTAAATT TCGATTTGGT CTAGGGCACC 60 

CCTCCAGCGA CGGCCGAAAT GGGCTAGCCA TGCCCATAGT AGGACTAGCA AACGGAGGGA 120 

CTAGCCGTAG TGGCGAGCTC CCTGGGTGGT CTAAGTCCTG AGTACAGGAC AGTCGTCAGT 180 

AGTTCGACGT GAGCACTAGC CCACCTCGAG ATGCTACGTG GACGAGGGCA TGCCCAAGAC 240 

ACACCTTAAC CCTGGCGGGG GTCGCTAGGG TGAAATCACA TTATGTGATG GGGGTACGAC 300 



Mm. 




o 
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CTGATAGGGT GCTGCAGAGG CCCACTAGCA GGCTAGTATA AAAATCTCTG CTGTACATGG 360 

CAC ATG GAG TTG AAT CAT TTT GAA TTA TTA TAC AAA ACA AGC AAA CAA 4 08 

■■sfts** liiilj Me G1U Leu Asn His Phe Glu Ijeu Leu Tyr Lys Thr Ser Lys Gin 

AAA CCA GTG GGA GTG GAG GAA CCG GTG TAT GAC ACC GCG GGG AGA CCA 456 
Lys Pro Val Gly Val Glu Glu Pro Val Tyr Asp Thr Ala Gly Arg Pro 

20 25 30 

CTA TTT GGG AAC CCA AGT GAG GTA CAC CCA CAA TCA ACG CTG AAG CTG 504 
Leu Phe Gly Asn Pro Ser Glu Val His Pro Gin Ser Thr Leu Lys Leu 
35 40 45 

CCA CAC GAC AGG GGG AGA GGA GAT ATC AGA ACA ACA CTG AGG GAC CTA 552 
. .... . . , Pro Hls As P Ar 9 G1 Y Ar 9 Gly Asp He Arg Thr Thr Leu Arg Asp Leu 

mm?: 50 55 60 



CCC AGG AAA GGT GAC TGT AGG AGT GGC AAC CAT CTA GGC CCG GTT AGT 
Pro Arg Lys Gly Asp Cys Arg Ser Gly Asn His Leu Gly Pro Val Ser 
65 70 75 



600 



GGG ATA TAC ATA AAG CCC GGC CCT GTC TAC TAT CAG GAC TAC ACG GGC 648 
Gly He Tyr He Lys Pro Gly Pro Val Tyr Tyr Gin Asp Tyr Thr Glv 
80 85 90 95 

CCA GTC TAT CAC AGA GCT CCT TTA GAG TTC TTT GAT GAG GCC CAG TTC 696 
Pro Val Tyr His Arg Ala Pro Leu Glu Phe Phe Asp Glu Ala Gin Phe 
100 105 HO 

TGC GAG GTG ACT AAG AGA ATA GGC AGG GTC ACG GGT AGT GAT GGT AAG 744 
Cys Glu Val Thr Lys Arg He Gly Arg Val Thr Gly Ser Asp Gly Lys 
115 120 125 

CTT TAC CAC ATA TAT GTG TGC GTC GAT GGT TGC ATA CTG CTG AAA TTA 792 
Leu Tyr His He Tyr Val Cys Val Asp Gly Cys He Leu Leu Lys Leu 
130 135 140 

GCC AAA AGG GGC ACA CCC AGA ACC CTA AAG TGG ATT AGG AAC TTC ACC 840 
Ala Lys Arg Gly Thr Pro Arg Thr Leu Lys Trp He Arg Asn Phe Thr 
I 45 150 155 

AAC TGT CCA TTA TGG GTA ACC AGT TGC TCC GAT GAC GGC GCA AGT GGC 888 
Asn. Cys Pro Leu Trp Val Thr Ser Cys Ser Asp Asp Gly Ala Ser Gly 
160 165 170 175 

AGC AAG GAT AAG AAG CCA GAC AGA ATG AAC AAA GGT AAG TTG AAG ATA 936 
Ser Lys Asp Lys Lys Pro Asp Arg Met Asn Lys Gly Lys Leu Lys He 
180 185 l!o 

GCC CCA AGA GAG CAT GAG AAG GAC AGC AAG ACC AAG CCT CCT GAT GCA 984 
Ala Pro Arg Glu His Glu Lys Asp Ser Lys Thr Lys Pro Pro Asp Ala 

,V; " ,: • 195 200 205 
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ACG ATT GTA GTA GAG GGA GTA AAA TAC CAA ATC AAA AAG AAA GGC AAA 1032 
Thr lie Val Val Glu Gly Val Lys Tyr Gin lie Lys Lys Lys Gly Lys 
210 215 220 

GTC AAA GGG AAG AAC ACA CAA GAC GGC CTG TAC CAT AAT AAG AAC AAG 1080 
Val Lys Gly Lys Asn Thr Gin Asp Gly Leu Tyr His Asn Lys Asn Lys 
225 230 235 



CCA CCA GAG TCC AGG AAG AAA CTA GAA AAA GCC CTG TTG GCT TGG GCG 
Pro Pro Glu Ser Arg Lys Lys Leu Glu Lys Ala Leu Leu Ala Trp Ala 
240 245 250 255 



1128 



Hi 



GTG ATA ACA ATC TTG CTG TAC CAG CCT GTA GCA GCC GAG AAC ATA ACT 1176 
Val lie Thr lie Leu Leu Tyr Gin Pro Val Ala Ala Glu Asn lie Thr 
260 265 270 

CAA TGG AAC CTG AGT GAC AAC GGC ACT AAT GGT ATT CAG CGA GCC ATG 1224 
Gin Trp Asn Leu Ser Asp Asn Gly Thr Asn Gly lie Gin Arg Ala Met 
275 280 285 

TAT CTT AGA GGG GTT AAC AGG AGC TTA CAT GGG ATC TGG CCC GAG AAA 1272 
Tyr Leu Arg Gly Val Asn Arg Ser Leu His Gly He Trp Pro Glu Lys 
290 295 300 

ATA TGC AAG GGG GTC CCC ACT CAT CTG GCC ACT GAC ACG GAA CTG AAA 1320 
He Cys Lys Gly Val Pro Thr His Leu Ala Thr Asp Thr Glu Leu Lys 
305 310 315 

GAG ATA CGC GGG ATG ATG GAT GCC AGC GAG AGG ACA AAC TAT ACG TGC 1368 
Glu He Arg Gly Met Met Asp Ala Ser Glu Arg Thr Asn Tyr Thr Cys 
3 20 325 330 335 

TGT AGG TTA CAA AGA CAT GAA TGG AAC AAA CAT GGA TGG TGT AAC TGG 1416 
Cys Arg Leu Gin Arg His Glu Trp Asn Lys His Gly Trp Cys Asn Trp 
340 345 350 

TAC AAC ATA GAC CCT TGG ATT CAG TTA ATG AAC AGG ACC CAA ACA AAT 1464 
Tyr Asn He Asp Pro Trp He Gin Leu Met Asn Arg Thr Gin Thr Asn 

355 360 365 

TTG ACA GAA GGC CCT CCA GAT AAG GAG TGT GCC GTG ACC TGC AGG TAT 1512 
Leu Thr Glu Gly Pro Pro Asp Lys Glu Cys Ala Val Thr Cys Arg Tyr 
370 375 380 

GAC AAA AAT ACC GAT GTC AAC GTG GTC ACC CAG GCC AGG AAT AGG CCA 1560 
Asp Lys Asn Thr Asp Val Asn Val Val Thr Gin Ala Arg Asn Arg Pro 
385 390 395 

ACT ACT CTG ACT GGC TGC AAG AAA GGG AAA AAC TTT TCA TTC GCA GGC 1608 
Thr Thr Leu Thr Gly Cys Lys Lys Gly Lys Asn Phe Ser Phe Ala Gly 
400 405 410 415 

ACA GTC ATA GAG GGC CCG TGC AAT TTC AAC GTT TCC GTG GAG GAC ATC 1656 
Thr Val He Glu Gly Pro Cys Asn Phe Asn Val Ser Val Glu Asp He 
420 425 430 
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Su Glv ^ ^ T AGT CTG CTC CAG GAC ACG GG * CTG 

Leu Tyr Gly Asp His Glu Cys Gly Ser Leu Leu Gin Asp Thr Ala Leu 

4Ji) 440 445 

TAC CTA TTG GAT GGA ATG ACC AAC ACT ATA GAG AAT GCC AGG CAA GGT 
Tyr Leu Leu Asp Gly Met Thr Asn Thr lie Glu Asn Ala Arg Gin gJv 
° 455 460 

GCG GCG CGG GTG ACA TCT TGG CTT GGG AGG CAG CTC AGT ACC GCA rrr 
Ala Ala Arg Val Thr Ser Trp Leu Gly Arg Gin Su Ser J£ 22 Sy 



AAG AAG 
Lys Lys 
480 

TCA CCT 
Ser Pro 



AAC AAC 
Asn Asn 



CTA GAG AGG AGA AGC AAA ACC TGG TTT GGT GCC TAT GCC CTG 
Leu Glu Arg Arg Ser Lys Thr Trp Phe Gly Ala Jjr S£ SS 
485 490 495 

TAC TGC AAT GTG ACT AGA AAA ATA GGG TAC ATA TGG TAT ACA 
Tyr cys Asn Val Thr Arg Lys lie Gly Tyr He Trp -Eyr i£ 
500 505 5 io 

Cys i£ ££ A?? IT CTC CCT *** W ACA *** ATA A ™ GGC 
Cys Thr Pro Ala Cys Leu Pro Lys Asn Thr Lys He He Gly 

515 520 525 

Pro S?5 fi 6 ? AC ACC *** GCG GAA GAC GGG ^ A TC CTT CAT GAA 

Pro Gly Lys Phe Asp Thr Asn Ala Glu Asp Gly Lys He Lei Ss Glu 

535 540 

A S 2 GC CAC CTA TCA GAA TTT TTG TTG CTT TCT CTA GTT ATC CTG 

Met Gly Gly Hxs Leu Ser Glu Phe Leu Leu Leu Ser Su vTl He Su 

550 ^ 555 

TCT GAC TTT GCC CCC GAG ACA GCT AGC ACG CTA TAC 
Ser Asp Phe Ala Pro Glu Thr Ala Ser Su <£r 

565 570 



E£ Afa S CC Sf 6 TCC CAC GAA GAA CCT ^ GGT 

Tyr Ala He Pro Gin Ser His Glu Glu Pro Glu Glv 

580 585 

CAA CTT AAC CTA ACA GTG AAA CTT AGG ACA GAA GAC 
Gin Leu Asn Leu Thr Val Lys Leu Arg Thr Glu Asp 
595 600 

TCA GTT TGG AAT ATT GGC AAA TAT GTT TGT GTT AGA 
Ser val Trp Asn He Gly Lys Tyr Val Cys Val aS 
610 615 



CTA ATT TTA CAC 
Leu He Leu His 
575 

TGT GAT ACG AAC 
Cys Asp Thr Asn 
590 

GTA GTG CCA TCA 
Val Val Pro Ser 
605 

CCA GAC TGG TGG 
Pro Asp Trp Trp 
620 



£o Jyr ?£ J£ S? GC I CTG TTT GAA GAG GGA G GA CAG GTT 

iyr Glu Thr Lys Val Ala Leu Leu Phe Glu Glu Ala Gly Gin Val 

630 635 

S E £ g S S 5 g S « « «- g 



1704 



1752 



1800 



1848 



1896 



1944 



1992 



2040 



2088 



2136 



2184 



2232 



2280 



2328 
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35 



AGC GCA TCA ACT ACT GCG TTT CTC ATT TGC TTG ATA AAA GTA TTG AGA 
Ser Ala Ser Thr Thr Ala Phe Leu He Cys Leu He Lys Val Leu Arg 
660 665 670 

GGA CAG GTT GTG CAA GGT ATA ATA TGG CTG CTG CTG GTG ACC GGG GCA 
Gly Gin Val Val Gin Gly He He Trp Leu Leu Leu Val Thr Gly Ala 
675 680 685 

CAA GGG CGG CTA GCC TGT AAG GAA GAC TAC AGG TAT GCG ATC TCG TCA 
Gin Gly Arg Leu Ala Cys Lys Glu Asp Tyr Arg Tyr Ala He Ser Ser 
690 695 700 

ACC AAT GAG ATA GGG CTG CTG GGC GCT GAA GGT CTC ACC ACT ACC TGG 
Thr Asn Glu He Gly Leu Leu Gly Ala Glu Gly Leu Thr Thr Thr Trn 
705 710 715 * 

AAA GAA TAC AGC CAC GGT TTG CAG CTG GAC GAC GGA ACC GTT AAG GCC 
Lys Glu Tyr Ser His Gly Leu Gin Leu Asp Asp Gly Thr Val Lys Ala 
720 725 730 * 735 

GTC TGC ACT GCA GGG TCC TTT AAA GTC ACA GCA CTT AAC GTG GTT AGT 
Val Cys Thr Ala Gly Ser Phe Lys Val Thr Ala Leu Asn Val Val Ser 
740 745 750 

AGG AGG TAT CTA GCA TCA TTG CAC AAG AGG GCT CTA CCC ACC TCA GTG 
Arg Arg Tyr Leu Ala Ser Leu His Lys Arg Ala Leu Pro Thr Ser Val 
755 760 765 

ACA TTT GAG CTC CTA TTT GAC GGG ACC AAC CCA GCA ATC GAG GAG ATG 
Thr Phe Glu Leu Leu Phe Asp Gly Thr Asn Pro Ala He Glu Glu Met 
7? 0 775 780 

GAT GAT GAC TTC GGA TTT GGG CTG TGC CCA TTT GAC ACG AGT CCT GTG 
Asp Asp Asp Phe Gly Phe Gly Leu Cys Pro Phe Asp Thr Ser Pro Val 
7«5 790 7g5 

ATC AAA GGG AAG TAC AAC ACC ACT TTG TTA AAC GGC AGT GCT TTC TAT 
He Lys Gly Lys Tyr Asn Thr Thr Leu Leu Asn Gly Ser Ala Phe Tyr 
800 805 810 815 

CTA GTC TGC CCA ATA GGA TGG ACT GGT GTC GTA GAG TGC ACA GCA GTG 
Leu Val Cys Pro He Gly Trp Thr Gly Val Val Glu Cys Thr Ala Val 
820 825 830 

AGC CCC ACA ACC TTG AGA ACA GAA GTG GTG AAA ACC TTC AGG AGA GAT 
Ser Pro Thr Thr Leu Arg Thr Glu Val Val Lys Thr Phe Arg Arg Asp 
835 8 40 845 

£1 SS ? GA GTA GAC TGT GTG ACC ACC ATA GT * GAA AAA 

Lys Pro Phe Pro His Arg Val Asp Cys Val Thr Thr He Val Glu Lvs 

850 855 860 

GAA GAC CTA TTC CAT TGC AAG TTG GGG GGT AAT TGG ACA TGT GTA AAA 
Glu Asp Leu Phe His Cys Lys Leu Gly Gly Asn Trp Thr Cys Val Lys 
865 870 875 



2376 



2424 



2472 



2520 



2568 



2616 



2664 



2712 



2760 



2808 



2856 



2904 



2952 



3000 
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III fill! 



GGC GAC CCA GTG ACT TAT AAG GGG GGG CAA GTA AAG CAG TGC AGG TGG 3048 
Gly Asp Pro Val Thr Tyr Lys Gly Gly Gin Val Lys Gin Cys Arg Trp 
880 885 ago * 89 % 

TGT GGT TTC GAG TTT AAA GAG CCC TAC GGG'CTC CCA CAC TAC CCT ATA 3096 
Cys Gly Phe Glu Phe Lys Glu Pro Tyr Gly Leu Pro His Tyr Pro He 
900 905 910 

GGC AAG TGC ATC CTA ACA AAT GAG ACA GGT TAC AGG GTA GTA GAT TCC 3144 
Gly Lys Cys He Leu Thr Asn Glu Thr Gly Tyr Arg Val Val Asp Ser 
915 920 925 

ACA GAC TGC AAC AGA GAT GGC GTC GTT ATT AGC ACT GAA GGG GAA CAT 3192 
Thr Asp Cys Asn Arg Asp Gly Val Val He Ser Thr Glu Gly Glu His 
930 935 940 

GAG TGC TTG ATT GGC AAC ACT ACC GTC AAG GTG CAT GCA CTG GAT GAA 3240 
Glu Cys Leu He Gly Asn Thr Thr Val Lys Val His Ala Leu Asp Glu 
945 950 955 

AGA TTG GGC CCT ATG CCG TGC AGA CCC AAA GAA ATC GTC TCT AGT GAG 3288 
Arg Leu Gly Pro Met Pro Cys Arg Pro Lys Glu He Val Ser Ser Glu 
960 965 970 975 

GGA CCT GTG AGG AAA ACT TCT TGT ACA TTC AAC TAC ACA AAG ACT CTA 3336 
Gly Pro Val Arg Lys Thr Ser Cys Thr Phe Asn Tyr Thr Lys Thr Leu 
980 985 990 

AGA AAC AAA TAC TAT GAG CCC AGA GAC AGT TAC TTC CAG CAA TAT YTG 3384 
Arg Asn Lys Tyr Tyr Glu Pro Arg Asp Ser Tyr Phe Gin Gin Tyr Met 
995 1000 1005 

CTC AAG GGC GAG TAT CAA TAC TGG TTT AAT CTG GAC GTG ACC GAC CAC 3432 
Leu Lys Gly Glu Tyr Gin Tyr Trp Phe Asn Leu Asp Val Thr Asp His 
1010 1015 1020 

CAC ACA GAC TAC TTT GCC GAG TTT GTT GTC TTG GTA GTA GTA GCA CTG 3480 
His Thr Asp Tyr Phe Ala Glu Phe Val Val Leu Val Val Val Ala Leu 
1025 1030 1035 

TTA GGA GGA AGG TAC GTT CTG TGG CTA ATA GTG ACC TAC ATA ATT CTA 3528 
Leu Gly Gly Arg Tyr Val Leu Trp Leu He Val Thr Tyr He He Leu 
1040 1045 1050 1055 

ACA GAG CAG CTC GCT GCT GGT CTA CAG CTA GGC CAG GGT GAG GTG GTA 3576 
Thr Glu Gin Leu Ala Ala Gly Leu Gin Leu Gly Gin Gly Glu Val Val 
1060 1065 1070 

TTG ATA GGG AAC CTA ATT ACC CAC ACG GAC AAT GAG GTG GTG GTG TAC 3624 
Leu He Gly Asn Leu He Thr His Thr Asp Asn Glu Val Val Val Tyr 
1075 1080 1085 

TTC CTA CTG CTC TAC TTA GTA ATA AGA GAT GAG CCC ATA AAG AAA TGG 3672 
Phe Leu Leu Leu Tyr Leu Val He Arg Asp Glu Pro He Lys Lys Trp 
1090 1095 1100 
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A?f S? SI? iS T t GA I 66 CTG GAC GTA GCC GGA TTG TTG CTG CAG 

1 hr Arg Trp Asn 1611 As P Val Ala G1 y ^ Leu Leu Gin 
1265 1270 1275 

TGC GTC CCA ACT CTT TTA ATG GTT TTT ACG ATG TGG GCA GAC ATT CTC 
Cys Val Pro Thr Leu Leu Met Val Phe Thr Met Trp aS Sp He SS 

1285 1290 1295 

J£ He 2n A T A S CT A P TAT GAG OTA ACA TTA TAG TAC 

Thr Leu He Leu He Leu Pro Thr Tyr Glu Leu Thr Lys Leu Tyr Tvr 

1300 1305 i3 10 y 

CTT AAG GAA GTG AAG ATT GGG GCA GAA AGA GGT TGG CTG TGG AAA ACT 
Leu Lys Glu Val Lys lie Gly Ala Glu Arg Gly Trp Leu ™ ™ JS 
1315 1320 1325 



3720 



3768 



ATA CTA CTG CTG TTT CAT GCA ATG ACT AAC AAT CCA GTC AAG ACC ATA 

Phe His Ala Met Thr Asn Asn Pro Val Lys Thr He 
H05 1110 1115 

HI #P ACA GTA GCA TTG CTA ATG ATC AGT GGG GTT GCC AAG GGT GGT AAG ATA 

Thr Val Ala Leu Leu Met He Ser Gly Val Ala Lys Gly Gly Lys He 
1120 1125 H30 H3 5 

GAT GGT GGC TGG CAG AGA CAA CCG GTG ACC AGT TTT GAC ATC CAA CTC 38! 6 
Asp Gly Gly Trp Gin Arg Gin Pro Val Thr Ser S£ Asp He Gin SS 
1140 1145 1150 

GCA CTG GCA GTC GTA GTA GTC GTT GTG ATG TTG CTG GCA AAG AGA GAC 3864 
Ala Leu Ala Val Val Val Val Val Val Met Leu Leu aS Lys Arg Asp 
: ;.; 1155 1160 1165 * 

S CG TTC CCT TTG GTA ATC ACA GTG G CA ACC CTG AGA ACG GCC 3912 

Pro Thr Thr Phe Pro Leu Val He Thr Val Ala Thr Leu Arg Thr A 
1170 1175 1180 

fif A T A A k C G ? T TTT AGC ACA GAT CTA GTC ATA G CC ACA GTG TCG 3960 

Lys lie Thr Asn Gly Phe Ser Thr Asp Leu Val He Ala Thr Val Ser 
H 85 1190 1195 

GCA GCT TTG TTA ACT TGG ACC TAT ATC AGC GAC TAC TAC AAA TAC AAG 

A 2oo Trp Thr Tyr Ile Ser Asp T y r ^ L ^ s T y r L y s 

1205 1210 1215 

ACT TGG CTA CAG TAC CTC GTC AGC ACG GTG ACT GGA ATC TTC CTG ATA 40«5fi 
Thr Trp Leu Gin Tyr Leu Val Ser Thr Val Thr Gly ile p£e He 
1220 1225 1230 

S T ? ^ G f* 6 ^? A ATA GGC GAA TTG GAT 016 CAC G CC CCA ACC TTG 4104 
Arg Val Leu Lys Gly He Gly Glu Leu Asp Leu His Ala Pro Thr Leu 
1235 1240 1245 



4008 



m g£ ™ £ C ? TC JTT TAC ATC CTT GTA TAC CTT ATT TCC ACT 4152 

^ Pro Ser Ar 9 Pro Leu Phe Tyr Ile Leu Val Tyr Leu Ile Ser Thr 

1250 1255 1260 



4200 



4248 



4296 



4344 
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AAC TAT AAG AGG GTA AAC GAC ATC TAC GAG GTC GAC CAA ACT AGC GAA 
Asn Tyr Lys Arg Val Asn Asp He Tyr Glu Val Asp Gin Thr Ser Glu 
1330 1335 1340 

HI < : ' :;i $' GGG GTT TAC CTT TTC CCT TCT AAA CAG AGG ACG AGC GCT ATA ACT ACT 

Gly Val Tyr Leu Phe Pro Ser Lys Gin Arg Thr Ser Ala He Thr Ser 
1345 1350 1355 



1395 i4oo 



1405 



GTT TCA AGG CTC GTG GCG GCT TTG ATT GAA GTC AAT TGG GCC TTC GAC 
Val ser Arg Leu Val Ala Ala Leu He Glu Val Asn Trp Ala Phe Asp 
1410 1415 142Q 

AAT GAA GAA GTC AAA GGC TTA AAG AAG TTC TTC TTG CTG TCT AGT AGC 

ASn So C G1U Val LyS Gly Lys L ^ s Phe Phe Leu Leu Ser Ser Arg 

1425 1430 1435 * 

GTC AAA GAG TTG ATC ATC AAA CAC AAA GTG AGG AAT GAA GTA GTG GTC 

YSn yS ^ U 116 116 LYS His Lys Val Ar ^ Asn Glu Val Val Val 

1440 144 5 1450 1455 

CGC TGG TTT GGA GAT GAA GAG ATT TAT GGG ATG CCA AAG CTG ATC GGC 
Arg Trp Phe Gly Asp Glu Glu He Tyr Gly Met Pro Lys Leu He Gly 
14 60 1465 1470 



4392 



4440 



ACC ATG TTG CCA TTA ATC AAA GCC ATA CTC ATT AGC TGC ATC AGC AAC 4488 
Thr Met Leu Pro Leu He Lys Ala He Leu He Ser Cys He Ser Asn 
1360 "65 1370 1375 

AAG TGG CAA CTC ATA TAC TTA CTG TAC TTG ATA TTT GAA GTG TCT TAC 
Lys Trp Gin Leu He Tyr Leu Leu Tyr Leu He Phe Glu Val Ser Tvr 
1380 1385 1390 



4536 



tO;- " ; !E AC ? TC CAC AAG AAA GTT ATA GAT GAA ATA GCT GGT GGG ACC AAC TTC 4584 

^ ^ Hi. Lys Lys Val He Asp Glu He Ala Gly Gly Thr Asn Phe 



4632 



4680 



4728 



4776 



H ? TA AAG GCA GCA ACA CTA AGT A GA AAC AAA CAC TGT ATG TTG TGT 4824 

# Val A ** Ala Thr Leu Ser Arg Asn Lys His Cys Met Leu Cys 

1475 1480 1485 

ACC GTC TGT GAG GAC AGA GAT TGG AGA GGG GAA ACT TGC CCT AAA TGT 
Thr Val Cys Glu Asp Arg Asp Trp Arg Gly Glu Thr Cys Pro Lys Cys 
1490 1495 1500 

GGG CGT TTT GGA CCA CCA GTG GTC TGC GGT ATG ACC CTA GCC GAT TTC 
Gly Arg Phe Gly Pro Pro Val Val Cys Gly Met Thr Leu Ala Asp Phe 
1505 1510 1515 

GAA GAA AAA CAC TAT AAA AGG ATT TTC ATT AGA GAG GAC CAA TCA GGC 
Glu Glu Lys His Tyr Lys Arg He Phe He Arg Glu Asp Gin Ser Gly 
III" 1525 1530 1535 

^ CTT AGG GAG GAG ° AT GCA 6(5(3 TAC TTG CAG TAC AAA GCC AGG 

Gly Pro Leu Arg Glu Glu His Ala Gly Tyr Leu Gin Tyr Lys Ala Arg 
1540 1545 155Q 



4872 



4920 



4968 



5016 
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SJ Sn Ph T ^ J GG CTC CCA GTG TTA GCT ACA AAA GTC AAG 

Gly Gin Leu Phe Leu Arg Asn Leu Pro Val Leu Ala Thr Lys Val Lys 

1555 1560 i5 6 5 



5064 



||| ATG CTC CTG GTT GGT AAC CTC GGG ACA GAG ATT GGG GAT CTC GAA CAC 5112 

* - Met Leu Leu val Gly Asn Leu Gly Thr Glu He Gly Asp Leu GlS Sis 

1570 1575 1580 

Su G?v V«? Sf 3 CCA GCT GIT TGC AAG AAG GTT ACT GAA 

Leu Gly Trp Val Leu Arg Gly Pro Ala Val Cys Lys Lys Val Thr Glu 

1590 1595 

CAC GAA AGA TGC ACC ACG TCT ATA ATG GAT AAG TTG ACT GCT TTC TTT 
1600 r9 ° yS Thr Ser Ile Met As P L * s ^ T «r Ala Se PlSe 

B^l 1615 

^ GA f, TA t T< Z CCA AGG GGC ACT ACT CCC AGA GCT CCC GTA AGA TTC CCT 
Gly Val Met Pro Arg Gly Thr Thr Pro Arg Ala Pro Val Arg Phe p£> 
1620 1625 1630 

ACC TCC CTC CTA AAG ATA AGA AGA GGG CTG GAG ACT GGT TGG GCT TAC 
Thr Ser Leu Leu Lys lie Arg Arg Gly Leu Glu Thr Gly Trp Ala Tyr 
1635 1640 1645 

Thr 2£ r?° A T° AGC TCA GTA GAC CAT GTC ACT TGT GGG AAA 

Thr His Gin Gly Gly He Ser Ser Val Asp His Val Thr Cys Gly Lys 
1650 1655 1660 

■P GAC TTA CTG GTG TGT GAC ACC ATG GGT CGG ACA AGG GTT GTT TGC CAG 

^ ASP Val C * s As P Met Gly Arg Thr Arg Val Val Cys SS 

=J 1670 1675 

„. Si ^ ^ T AAG ATG ACC GAC GAG TCC GAA TAC GGA GTC AAA ACT GAC 

y, ^er Asn Asn Lys Met Thr Asp Glu Ser Glu Tyr Gly Val Lys tS Asp 

= . m 1680 1685 1690. 1695 

IB Ser So ?f ?? A ? CC AGG TGT TAC GTG TTT AAC CCG GAA GCA 

TO Ser G1 y °y s Pro Glu Gly Ala Arg Cys Tyr Val Phe Asn Pro Glu Ala 

1700 1705 1710 

GTT AAC ATA TCA GGC ACT AAA GGA GCC ATG GTC CAC TTA CAG AAA ACG 
Val Ash lie Ser Gly Thr Lys Gly Ala Met Val His Leu" cln J£ ihr 
1715 1720 1725 

r^l r? A S?* ACC TGT GTG ACA GCA TCA GGA ACC CCG GCC TTC TTT 

Gly Gly Glu Phe Thr Cys Val Thr Ala Ser Gly Thr Pro Ala Phe p£e 
1730 1735 1740 

*tP ; a AG S C ™* CTT J* 8 GGC TGG TCA GGG CTA CCG ATA TTT GAA GCA 

ASP ^4. LyS Asn Leu ^ G1 y Tr P Gly Leu Pro Ile Phe Glu Ala 
IMxi • L/ *° 1750 1755 

TCA AGT GGA AGG GTA GTC GGA AGG GTC AAG GTC GGG AAG AAC GAG GAT 
Ser Ser Gly Arg Val Val Gly Arg Val Lys Val Gly L^s A^n £S Asp 

1765 1770 1? ^ 5 



5160 



5208 



5256 



5304 



5352 



5400 



5448 



5496 



5544 



5592 



5640 



5688 
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TCC AAA CCA ACC AAG CTC ATG AGT GGG ATA CAA ACG GTT TCT AAA AGC 
Ser Lys Pro Thr Lys Leu Met Ser Gly He Gin Thr Val Ser Lys Ser 
178 0 1785 1790 

G ° C %2 A GAC TTG ACG GAG ATG GTG AAG ATA ACG ACC ATG AAC AGG 

Ala Thr Asp Leu Thr Glu Met Val Lys Lys He Thr Thr Met Asn Arg 
1795 1800 1805 

GGA GAG TTC AGA CAA ATA ACC CTG GCC ACA GGT GCC GGA AAA ACT ACA 
Gly Glu Phe Arg Gin He Thr Leu Ala Thr Gly Ala Gly Lys Thr Thr 
1810 1815 1820 

GAG CTC CCT AGA TCA GTT ATA GAA GAG ATA GGG AGG CAT AAG AGG GTG 
Glu Leu Pro Arg Ser Val He Glu Glu He Gly Arg His Lys Arg Val 
1825 1830 1835 

TTG GTC TTA ATC CCC TTG AGG GCG GCA GCA GAA TCA GTA TAC CAA TAC 
Leu Val Leu He Pro Leu Arg Ala Ala Ala Glu Ser Val Tyr Gin Tyr 
1840 1845 1850 1855 

ATG AGA CAG AAA CAT CCG AGT ATA GCA TTC AAT CTA AGG ATA GGT GAG 
Met Arg Gin Lys His Pro Ser He Ala Phe Asn Leu Arg He Gly Glu 
18 60 1865 1870 

ATG AAG GAA GGT GAT ATG GCC ACG GGA ATA ACC TAT GCC TCT TAC GGT 
Met Lys Glu Gly Asp Met Ala Thr Gly He Thr Tyr Ala Ser Tyr Gly 
187 5 1880 1885 

TAC TTT TGC CAG ATG TCA CAA CCC AAG CTG AGA GCC GCA ATG GTA GAA 
Tyr Phe Cys Gin Met Ser Gin Pro Lys Leu Arg Ala Ala Met Val Glu 
1890 1895 , 1900 

TAT TCC TTT ATA TTC CTA GAT GAG TAT CAT TGT GCT ACC CCA GAA CAA 

Yr ?«L Phe Ile Phe ^ Asp Glu Tvr His A la Thr Pro Glu Gin 

19 °5 1910 1915 

CTG GCA ATC ATG GGG AAG ATC CAC AGA TTC TCA GAA AAC CTG CGG GTG 
Leu Ala He Met Gly Lys Ile His Arg Phe Ser Glu Asn Leu Arg Val 
1920 1925 1930 1935 

GTA GCT ATG ACA GCG ACA CCG GCA GGC ACA GTA ACA ACC ACT GGG CAG 
Val Ala Met Thr Ala Thr Pro Ala Gly Thr Val Thr Thr Thr Gly Gin 
1940 1945 1950 

AAA CAC CCT ATA GAG GAA TTT ATA GCC CCG GAA GTG ATG AAA GGA GAA 
Lys His Pro Ile Glu Glu Phe Ile Ala Pro Glu Val Met Lys Gly Glu 
1955 1960 1965 

GAC TTG GGT TCT GAG TAC TTA GAT ATT GCC GGA CTG AAG ATA CCA GTA 
Asp Leu Gly Ser Glu Tyr Leu Asp Ile Ala Gly Leu Lys Ile Pro Val 
1970 1975 igso 

? AG , ? AG J* 6 ^ ATG CTA GTT TTT GTG CCC ACC AGG AAC ATG 

LyS Asn Asn Met ^ Val Phe Val Pr <> Thr Arg Asn Met 
1985 1990 1995 



5736 



5784 



5832 



5880 



5928 



5976 



6024 



6072 



6120 



6168 



6216 



6264 



6312 



6360 
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a ^ a a Aia Ly G Leu a a a GGA r c r tcg «<= «« 

2000 2005 Y Y yr n Ser G1 y 

2010 2015 

E E E a EE E 5 E - « - « « s 2 

2020 2 025 2030 

E E E EE E E S a a E S E E S a - 

2040 2045 

a 2 EE 2 E a EE E a 5 E £ E a - 

2055 2060 

£ EE E E EE S E S S E g 2 - 

2070 2075 
AAA AGA ATG GCC GTC ACT ATT rrr r*** ~ 

Lys Arg Met Ala Val ?£ u Sy Glu Sf 6 AGA AGA GGG AGG 6648 

2080 2085 " Arg Arg Ar 9 

2090 2095 

SaT GG * E S EE E E E E E a E E S S 

2105 2110 

?2 5 S SS E S E E S E E E E E 55 

2120 2125 

a a as a a e a a e e e e a s e 



6696 



6744 



£ a a e ee a e e e a a a a s a - 

2185 2190 

e a a e a a a a s a s a s e e a a 6984 

2200 2205 

a a a a a s a as a e a a a a a - 
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AGA AAA TTG GGA GAT GAC GTA CCC CCC TAC GTG TAT GCT ACA GAG GAT 

9 2lt 5 ^ U ASP ASP Pr ° Pr ° ^ Val ^ AS J£ Asp 

2230 2235 

GAG GAC TTG GCA GTG GAA CTG TTG GGC CTA GAT TGG CCG GAC CCA rrA 
Glu Asp Leu Ala Val Glu Leu Leu Gly Leu Asp Jg g£ 2£ g£ 

2245 2250 2255 

AAC CAA GGC ACC GTG GAA GCT GGC AGA GCA CTA AAA CAG GTG GTT rrT 
Asn Gin Gly Thr Val Glu Ala Gly Arg Ala LeS JJJ Sn vlt v7l 
2260 2265 2270 

s2 Thr aS SS i£ ? TA GTC GCC CTG TTC GGC TAC ™ 

bGr Thr Glu Asn Ala Val Ala Leu Phe Gly Tyr Val 

2275 2280 2285 

Jvr Sn Jf? T° TT ^ CA AGA CAT ATA CCA GTG GTC ACA GAT ATA 

Gly Tyr Gin Ala Leu Ser Lys Arg His He Pro Val Val Thr Asp lie 

2290 2295 2300 

Ifr s2 vll G^ HI S- C AGG ^ GAG GAC ACT ACG CAC CTA GAG TAT 
2305 P JJio G1U ASP 23^ ^ 

GCT CCG AAT GCC ATC AAG ACG GAG GGG AAG GAA ACT GAA TTG AAG CAP 
Ala Pro Asn Ala He Lys Thr Glu Gly Lys Glu tS G^ ™ £s SS 

2325 2330 2335 

S» ??! ? AG ^? G GAT GTG AGA TGT GTG GAA GCA GTG ACC AAT TAT 

Leu Ala Gin Gly Asp Val Gin Arg Cys Val Glu Ala Val Thr to £yr 

2340 2345 2350 

GCG AGA GAG GGC ATC CAA TTC ATG AAG TCG CAG GCA CTG AAA GTG AGA 
Ala Arg Glu Gly He Gin Phe Met Lys Ser Gin Ala Su i£ Arg 
2355 2360 2365 

GAA ACC CCT ACC TAT AAA GAG ACA ATG AAC ACC GTG GCA GAT TAT GTC 
Glu Thr Pro Thr Tyr Lys Glu Thr Met Asn Thr Vai a2 Asp Tyr vll 

2375 2380 

AAA AAG TTT ATT GAG GCA CTG ACG GAT AGC AAG GAA GAC ATC ATT AAA 
Lys Lys Phe He Glu Ala Leu Thr Asp Ser Lys G^ Asp He ^ 

2390 2395 

TAT GGG CTG TGG GGG GCA CAT ACG GCA TTG TAT AAG AGC ATT GGT GCC 
Tyr Gly Leu Trp Gly Ala^His Thr Ala Leu Tyr £s Ser 52 SJ S£ 

2405 2410 2415 

AGG CTT GGT CAC GAA ACC GCG TTC GCA ACT CTA GTT GTG AAG TGG TTG 
Arg Leu Gly Hxs Glu Thr Ala Phe Ala Thr Leu Val Va? £y*s ??p 2u 
2420 2425 2430 

s s s § k 2 a s gas s s s: s s s 



7080 



7128 



7176 



7224 



7272 



7320 



7368 



7416 



7464 



7512 



7560 



7608 



7656 



7704 
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GAC TTG GTC GTT TAT TAC ATT ATT AAC AGA CCT CAA TTC CCA GGA GAC 
Asp Leu val Val Tyr Tyr He He Asn Arg Pro Gin Phe Pro Gly Asp 
2450 2455 2460 

ACA GAA ACA CAA CAA GAA GGG AGA AAA TTT GTT GCC AGC CTG CTA GTC 
Thr Glu Thr Gin Gin Glu Gly Arg Lys Phe Val Ala Ser Leu Leu Val 
2465 2470 2475 

TCA GCT CTA GCG ACT TAT ACA TAC AAG AGC TGG AAC TAC AAT AAT CTG 

ofL LeU Ala Thr ^ Thr ^ L * s S ^ r Tr P Asn Asn Asn Leu 
2480 2485 2 490 2495 

TCC AAA ATA GTT GAA CCG GCT TTG GCT ACC CTG CCC TAT GCC GCT AAA 
Ser Lys He Val Glu Pro Ala Leu Ala Thr Leu Pro Tyr Ala Ala Lys 
2500 2505 2510 

GCC CTC AAG CTA TTT GCT CCT ACC CGA CTG GAG AGC GTT GTC ATA CTG 
Ala Leu Lys Leu Phe Ala Pro Thr Arg Leu Glu Ser Val Val He Leu 
2515 2520 2525 

AGC ACT GCA ATC TAC AAA ACA TAC CTA TCA ATA AGG CGA GGC AAA AGT 
Ser Thr Ala He Tyr Lys Thr Tyr Leu Ser He Arg Arg Gly Lys Ser 
2530 2535 2540 

GAT GGT CTG CTA GGT ACA GGG GTT AGC GCG GCC ATG GAA ATT ATG TCA 
Asp Gly Leu Leu Gly Thr Gly Val Ser Ala Ala Met Glu He Met Ser 
2545 2550 2555 

CAA AAC CCA GTA TCT GTG GGT ATA GCA GTT ATG CTA GGG GTA GGG GCT 
Gin Asn Pro Val Ser Val Gly He Ala Val Met Leu Gly Val Gly Ala 
2560 2565 2570 2575 



7752 



7800 



7848 



7896 



7944 



7992 



8040 



8088 



2580 



2595 



2600 



2610 



2615 



2625 



2630 



2640 



2645 



2660 



2665 



GAG 


CAA 


AAA 


AGA 


ACA 


CTA 


Glu 


Gin 


Lys 


Arg 


Thr 


Leu 


5 








2590 


GAC 


CAG 


GCC 


GCC 


ACC 


GAC 


Asp 


Gin 


Ala 


Ala 


Thr 


Asp 








2605 


ATA 


ATG 


GCT 


TTG 


TTC 


GAA 


He 


Met 


Ala 


Leu 


Phe 


Glu 






2620 






TTA 


GTG 


TAC 


CAC 


CTC 


TAT 


Leu 


Val 


Tyr 


His 


Leu 


Tyr 




2635 






GAG 


TTG 


GCC 


CAA 


AGA 


ACA 


Glu 


Leu 


Ala 


Gin 


Arg 


Thr 


2650 






2655 


TTC 


GAG 


GCT 


GTG 


GAA 


CTA 


Phe 


Glu 


Ala 


Val 


Glu 


Leu 










2670 



8136 



8184 



8232 



8280 



8328 



8376 
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CTG GGA GTA GAC AGT GAG GGA AAA ATT CGC CAG CTA TCG AGC AAT TAC 
Leu Gly Val Asp Ser Glu Gly Lys lie Arg Gin Leu Ser Ser Asn Tyr 
2675 2680 2685 

|||! ATA CTA GAG CTC TTG TAT AAG TTC CGC GAC AAT ATC AAG TCT AGT GTG 

lie Leu Glu Leu Leu Tyr Lys Phe Arg Asp Asn He Lys Ser Ser Val 
269 <> 2695 2700 

AGG GAG ATA GCA ATC AGC TGG GCC CCC GCC CCC TTT AGT TGC GAT TGG 
Arg Glu He Ala He Ser Trp Ala Pro Ala Pro Phe Ser Cys Asp Trp 
2705 2710 2715 

ACA CCA ACA GAT GAC AGA ATA GGG CTT CCC CAT GAC AAT TAC CTC CGA 
Thr Pro Thr Asp Asp Arg He Gly Leu Pro His Asp Asn Tyr Leu Arg 
w.Cd 2725 2730 2735 

m ' GTG GAG ACA AAG TGC CCC TGT GGT TAC AGG ATG AAA GCG GTA AAA AAC 

Val Glu Thr Lys Cys Pro Cys Gly Tyr Arg Met Lys Ala Val Lys Asn 
2740 2745 2750 

TGC GCT GGG GAG TTG AGA CTT CTG GAG GAA GGG GGT TCA TTC CTC TGC 
Cys Ala Gly Glu Leu Arg Leu Leu Glu Glu Gly Gly Ser Phe Leu Cys 
2755 2760 2765 

AGA AAT AAA TTC GGT AGA GGC TCA CAA AAC TAC AGG GTG ACA AAA TAC 
Arg Asn Lys Phe Gly Arg Gly Ser Gin Asn Tyr Arg Val Thr Lys Tvr 

mmm 2770 2775 2780 

TAT GAT GAC AAT TTA TCA GAA ATA AAA CCA GTG ATA AGA ATG GAA GGA 
Tyr Asp Asp Asn Leu Ser Glu He Lys Pro Val He Arg Met Glu Gly 
2785 2790 2795 

CAC GTG GAA CTG TAT TAC AAG GGG GCC ACT ATC AAA CTG GAT TTT AAC 
S„ Val Glu Leu Tyr Lys Gly Ala Thr He Lys Leu Asp Phe Asn 

2800 2805 2810 2815 

I® A* 0 AGT AAA ACG GTA CTG GCA ACT GAC AAA TGG GAG GTT GAC CAC TCC 88 <56 

Asn Ser *#• Thr Val Leu Ala Thr Asp Lys Trp Glu Val Asp His sSr 
2820 2825 2830 

ACC CTG GTT AGG GCA CTC AAG AGG TAC ACA GGG GCT GGA TAT CGA GGG 
Thr Leu Val Arg Ala Leu Lys Arg Tyr Thr Gly Ala Gly Tyr Arq Glv 
2835 2840 2845 

GCG TAT TTG GGT GAG AAA CCT AAC CAT AAA CAT CTG ATA CAG AGA GAC 
Ala Tyr Leu Gly Glu Lys Pro Asn His Lys His Leu He Gin Arg Asp 
2850 2855 2860 

M-m l GT GCA A <r G ATT ACC ^ GAC AAG GTC TGC TTC ATC AAA ATG AAG AGA 

Cys Ala Thr He Thr Lys Asp Lys Val Cys Phe lie Lys Met Lys Arg 

sg^j; 2865 2870 2875 

* ■ GGG TGT GCG TTC ACT TAT GAC CTA TCC CTC CAC AAC CTT ACC CGG CTA 

SSn YS Tyr ASP ^ S6r ^ His Asn ^ Th r Arg Leu 

2880 2 885 2890 y 2I95 



8424 



8472 



8520 



8568 



8616 



8664 



8712 



8760 



8808 



8904 



8952 



9000 



9048 



ATC GAA TTG GTA CAC AAG AAT AAC CTG GAA GAT AGA GAA ATC CCT GCT 
lie Glu Leu Val His Lys Asn Asn Leu Glu Asp Arg Glu He Pro Ala 
2900 2905 2910 

f, T< f ££ G G<I T ACA ACC TGG CTG GCC TAC ACA TTT GTG AAT GAA GAC ATA 
Val Thr Val Thr Thr Trp Leu Ala Tyr Thr Phe Val Asn Glu Asp He 
291 5 2920 2925 

GGG ACC ATA CCA ACT TTT GGG GAA AAG GTG ACA CCG GAG AAA CAG 

Gly Thr He Lys Pro Thr Phe Gly Glu Lys Val Thr Pro Glu Lys Gin 
2930 2935 2940 



GAG GAG GTA GTC TTG CAG CCT GCT GTG GTG GTG GAC ACA ACA GAT GTA 

S« 1 1 LeU Gln Pro Ala Val Val Val As P Thr Thr Asp Val 
2945 2 950 2955 

GCC GTG ACC GTG GTA GGG GAA ACC TCT ACT ATG ACT ACA GGG GAG ACC 
Ala Val Thr Val Val Gly Glu Thr Ser Thr Met Thr Thr Gly Glu Thr 
2960 2965 2970 2975 

CCG ACA ACA TTT ACC AGC TTA GGT TCG GAC TCG AAG GTC CGA CAA GTC 
Pro Thr Thr Phe Thr Ser Leu Gly Ser Asp Ser Lys Val Arg Gin Val 
29 80 2985 2990 

CTG AAG CTG GGC GTG GAC GAT GGT CAA TAC CCC GGG CCT AAT CAG CAG 
Leu Lys Leu Gly Val Asp Asp Gly Gin Tyr Pro Gly Pro Asn Gin Gin 
2995 3000 3005 

AGA GCA AGC CTG CTC GAA GCT ATA CAA GGT GTG GAT GAA AGC CCC TCG 
Arg Ala Ser Leu Leu Glu Ala He Gin Gly Val Asp Glu Arg Pro Ser 
3010 3015 3020 

GTA CTG ATA CTG GGG TCT GAT AAG GCC ACC TCC AAT AGG GTC AAG ACC 

™L GlY Ser Asp ^ Ala Thr Ser Asn Arg Val Lys Thr 

3025 3030 3035 

GCA AAG AAT GTG AAG ATA TAT AGG AGC AGG GAC CCC CTG GAA CTG AGA 

YS ASn Val LyS Ile Tyr Ar 9 Ser Ar< ? As P Pro Leu Glu Leu Arg 
3040 3045 3050 3055 

GAA ATG ATG AAA AGG GGA AAA ATC CTA GTC GTA GCC TTG TCT AGA GTC 
Glu Met Met Lys Arg Gly Lys Ile Leu Val Val Ala Leu Ser Arg Val 
3060 3065 3070 

GAT ACC GCT CTG CTG AAA TTC GTT GAT TAC AAA GGC ACC TTC CTG ACC 
Asp Thr Ala Leu Leu Lys Phe Val Asp Tyr Lys Gly Thr Phe Leu Thr 
3075 3080 3085 

AGA ^ G £5 C CTA GAG GCA TTA AGT CTG GGT AAG CCT AAG AAA AGA GAC 

9 Sn^ G1U Ala L6U Ser ^ *<ys l*o Lys Arg Asp 

3090 3095 3100 

ATA ACT AAA GCA GAA GCA CAA TGG CTG CTG CGC CTC GAA GAC CAA ATA 
lie Thr Lys Ala Glu Ala Gin Trp Leu Leu Arg Leu Glu Asp Gin He 

3105 3110 3115 
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GAA GAG CTG CCT GAC TGG TTC GCA GCC AAG GAA CCC ATA TTT CTA GAA 9768 
Glu Glu Leu Pro Asp Trp Phe Ala Ala Lys Glu Pro He Phe Leu Glu 
3120 3125 3130 3135 

GCC AAC ATT AAA CGT GAC AAG TAT CAC CTG GTA GGG GAC ATA GCC ACT 9816 
Ala Asn He Lys Arg Asp Lys Tyr His Leu Val Gly Asp He Ala Thr 
3140 3145 3150 

ATT AAA GAA AAA GCC AAA CAA CTG GGG GCA ACA GAC TCC ACA AAG ATA 9864 
He Lys Glu Lys Ala Lys Gin Leu Gly Ala Thr Asp Ser Thr Lys He 
3155 3160 3165 

TCA AAG GAG GTT GGC GCG AAA GTG TAT TCT ATG AAG CTG AGT AAC TGG 9912 
Ser Lys Glu Val Gly Ala Lys Val Tyr Ser Met Lys Leu Ser Asn Trp 
3170 3175 3180 

GTG ATA CAA GAA GAG AAT AAA CAA GGC AGC CTT GCC CCC CTG TTT GAA 9960 
Val He Gin Glu Glu Asn Lys Gin Gly Ser Leu Ala Pro Leu Phe Glu 
3185 3190 3195 

GAG CTC CTG CAA CAG TGC CCA CCC GGG GGC CAG AAC AAA ACC ACA CAT 10008 
Glu Leu Leu Gin Gin Cys Pro Pro Gly Gly Gin Asn Lys Thr Thr His 
3200 3205 3210 3215 

ATG GTC TCA GCC TAC CAA CTA GCT CAA GGG AAT TGG GTG CCA GTT AGT 10056 
Met Val Ser Ala Tyr Gin Leu Ala Gin Gly Asn Trp Val Pro Val Ser 
3220 3225 3230 

TGC CAC GTG TTC ATG GGG ACC ATA CCC GCC AGA AGA ACC AAG ACT CAT 10104 
Cys His Val Phe Met Gly Thr lie Pro Ala Arg Arg Thr Lys Thr His 
3235 3240 3245 

CCT TAT GAG GCA TAC GTT AAG CTA AGG GAG TTG GTA GAT GAA CAT AAG 10152 
Pro Tyr Glu Ala Tyr Val Lys Leu Arg Glu Leu Val Asp Glu His Lys 
3250 3255 3260 

ATG AAG GCA TTA TGT GGC GGA TCA GGC CTA AGT AAG CAC AAC GAA TGG 10200 
Met Lys Ala Leu Cys Gly Gly Ser Gly Leu Ser Lys His Asn Glu Trp 
3265 3270 3275 

GTA ATT GGC AAG GTC AAG TAT CAA GGA AAC CTG AGG ACC AAA CAC ATG 10248 
Val He Gly Lys Val Lys Tyr Gin Gly Asn Leu Arg Thr Lys His Met 
3280 3285 3290 32 95 

TTG AAC CCC GGA AAG GTG GCG GAG CAA CTG CAC AGA GAA GGG TAC AGG 10296 
Leu Asn Pro Gly Lys Val Ala Glu Gin Leu His Arg Glu Gly Tyr Arg 
3300 3305 3310 

CAC AAT GTG TAT AAT AAG ACA ATA GGT TCA GTG ATG ACA GCA ACT GGT 10344 
His Asn Val Tyr Asn Lys Thr He Gly Ser Val Met Thr Ala Thr Gly 
3315 3320 3325 

ATC AGG CTG GAG AAG TTA CCT GTG GTT AGG GCC CAA ACA GAC ACA ACC 10392 
He Arg Leu Glu Lys Leu Pro Val Val Arg Ala Gin Thr Asp Thr Thr 
3330 3335 334Q 
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AAC TTC CAC CAA GCA ATA AGG GAT AAA ATA GAC AAG GAG GAG AAC CTA 10440 
Asn Phe His Gin Ala He Arg Asp Lys He Asp Lys Glu Glu Asn Leu 
3345 3350 3355 

CAG ACC CCT GGC TTG CAT AAG AAG TTA ATG GAA GTC TTC AAT GCA TTA 10488 
Gin Thr Pro Gly Leu His Lys Lys Leu Met Glu Val Phe Asn Ala Leu 
3360 3365 3370 3375 

AAA AGA CCC GAG CTT GAG GCC TCT TAT GAC GCT GTG GAT TGG GAG GAA 10536 
Lys Arg Pro Glu Leu Glu Ala Ser Tyr Asp Ala Val Asp Trp Glu Glu 
3380 3385 3390 

TTG GAG AGA GGA ATA AAT AGG AAG GGT GCT GCT GGT TTC TTC GAA CGC 10584 
Leu Glu Arg Gly He Asn Arg Lys Gly Ala Ala Gly Phe Phe Glu Arg 
3395 3400 3405 

AAG AAC ATA GGA GAG GTT TTG GAT TCG GAA AAA AAT AAA GTC GAA GAG 10632 
Lys Asn He Gly Glu Val Leu Asp Ser Glu Lys Asn Lys Val Glu Glu 
3410 3415 3420 

GTT ATT GAC AGT TTG AAA AAA GGT AGG AAT ATC AGA TAC TAC GAA ACT 10680 
Val He Asp Ser Leu Lys Lys Gly Arg Asn He Arg Tyr Tyr Glu Thr 
3425 3430 3435 

GCA ATC CCG AAA AAC GAG AAG AGG GAT GTC AAT GAT GAC TGG ACC GCT 10728 
Ala He Pro Lys Asn Glu Lys Arg Asp Val Asn Asp Asp Trp Thr Ala 
3440 3445 3450 3455 

GGT GAC TTC GTA GAT GAG AAG AAG CCA AGA GTG ATA CAA TAC CCT GAG 10776 
Gly Asp Phe Val Asp Glu Lys Lys Pro Arg Val He Gin Tyr Pro Glu 
3460 ?465 3470 

GCT AAA ACT AGG TTG GCT ATT ACT AAG GTA ATG TAC AAG TGG GTC AAA 10824 
Ala Lys Thr Arg Leu Ala He Thr Lys Val Met Tyr Lys Trp Val Lys 
3475 3480 3485 

CAG AAG CCA GTT GTC ATA CCG GGT TAT GAA GGT AAG ACA CCC CTG TTT 10872 
Gin Lys Pro Val Val He Pro Gly Tyr Glu Gly Lys Thr Pro Leu Phe 
3490 3495 3500 

CAA ATT TTT GAC AAA GTG AAG AAA GAA TGG GAT CAA TTC CAA AAC CCT 10920 
Gin lie Phe Asp Lys Val Lys Lys Glu Trp Asp Gin Phe Gin Asn Pro 
3505 3510 3515 

GTG GCA GTT AGC TTT GAT ACC AAA GCG TGG GAT ACC CAG GTA ACC ACA 10968 
Val Ala Val Ser Phe Asp Thr Lys Ala Trp Asp Thr Gin Val Thr Thr 
3520 3525 3530 3535 

AGG GAT TTG GAG CTA ATA AGG GAT ATA CAG AAG TTC TAT TTT AAA AAG 11016 
Arg Asp Leu Glu Leu He Arg Asp He Gin Lys Phe Tyr Phe Lys Lys 
3540 3545 3550 

AAA TGG CAC AAA TTC ATT GAC ACC CTA ACC AAG CAC ATG TCA GAA GTA 11064 
Lys Trp Hxs Lys Phe He Asp Thr Leu Thr Lys His Met Ser Glu Val 
3555 3560 3565 
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CCC GTA ATC AGT GCC GAC GGG GAG GTA TAC ATA AGG AAA GGT CAG AGA 11112 
Pro Val He Ser Ala Asp Gly Glu Val Tyr He Arg Lys Gly Gin Arg 
3570 3575 3580 



llil GGC AGT GGG CAA CCT GAC ACG AGC GCA GGC AAC AGC ATG TTG AAT GTG 11160 

; !!< " G1 Y Ser Gly Gin Pro Asp Thr Ser Ala Gly Asn Ser Met Leu Asn Val 

3585 3590 3595 

TTG ACA ATG GTG TAT GCC TTC TGC GAG GCC ACG GGG GTA CCC TAC AAG 11208 
Leu Thr Met Val Tyr Ala Phe Cys Glu Ala Thr Gly Val Pro Tyr Lys 
3600 3605 3610 3615 

AGT TTT GAC AGA GTG GCA AAG ATC CAT GTC TGC GGG GAT GAT GGT TTC 11256 
Ser Phe Asp Arg Val Ala Lys He His Val Cys Gly Asp Asp Gly Phe 
3620 3625 3630 



" ' CTG ATT ACC GAA AGA GCT GGT GAG AAA TTT GCG AGT AAA GGA GTC 11304 

ilPl Leu He Thr Glu Arg Ala Leu Gly Glu Lys Phe Ala Ser Lys Gly Val 

3635 3640. 3645 

CAG ATC CTA TAC GAA GCT GGG AAG CCT CAA AAG ATC ACT GAA GGG GAC 11352 
Gin He Leu Tyr Glu Ala Gly Lys Pro Gin Lys He Thr Glu Gly Asp 
3650 3655 3660 

AAG ATG AAA GTA GCC TAT CAG TTT GAT GAT ATC GAG TTC TGC TCC CAT 11400 
Lys Met Lys Val Ala Tyr Gin Phe Asp Asp He Glu Phe Cys Ser His 
3665 3670 3675 

ACA CCA GTA CAA GTG AGG TGG TCA GAC AAT ACT TCC AGC TAC ATG CCG 11448 
Thr Pro Val Gin Val Arg Trp Ser Asp Asn Thr Ser Ser Tyr Met Pro 
3680 3685 3690 3695 

GGA AGG AAC ACG ACT ACA ATC CTG GCT AAA ATG GCT ACA AGG TTG GAT 11496 
Gly Arg Asn Thr Thr Thr He Leu Ala Lys Met Ala Thr Arg Leu Asp 
3700 3705 3710 



WM ~ CC AGT GGT GAG AGG GGT ACT ATA GCA TAT GAG AAG GCA GTG GCG TTC 11544 

1 Ser Ser Gly Glu Arg Gly Thr He Ala Tyr Glu Lys Ala Val Ala Phe 

~ 3715 3720 3725 

AGC TTT TTG TTG ATG TAC TCC TGG AAC CCA CTG ATC AGA AGG ATA TGC 11592 
Ser Phe Leu Leu Met Tyr Ser Trp Asn Pro Leu He Arg Arg He Cys 
3730 3735 3740 

TTA CTG GTG TTG TCA ACT GAG TTG CAA GTG AGA CCA GGG AAG TCA ACC 11640 
Leu Leu Val Leu Ser Thr Glu Leu Gin Val Arg Pro Gly Lys Ser Thr 
3745 3750 3755 

ACC TAT TAC TAT GAA GGG GAC CCA ATA TCC GCT TAC AAG GAA GTC ATT 11688 
Thr Tyr Tyr Tyr Glu Gly Asp Pro He Ser Ala Tyr Lys Glu Val He 
JP 3760 3765 3770 3775 

;; :>! GGC CAC AAT CTC TTT GAC CTT AAA AGA ACA AGC TTC GAA AAG CTA GCA 11736 

Gly His Asn Leu Phe Asp Leu Lys Arg Thr Ser Phe Glu Lys Leu Ala 
3780 3785 3790 



AAG TTA AAT CTC AGC ATG TCC ACG CTC GGG GTG TGG ACT AGA CAC ACT 
Lys Leu Asn Leu Ser Met Ser Thr Leu Gly Val Trp Thr Arg His Thr 
3795 3800 3805 



11784 



AGC AAG AGA TTA CTA CAA GAT TGT GTC AAT GTT GGC ACC AAA GAG GGC 11832 
Ser Lys Arg Leu Leu Gin Asp Cys Val Asn Val Gly Thr Lys Glu Gly 
3810 3815 3820 

AAC TGG CTG GTC AAT GCA GAC AGA CTA GTG AGT AGT AAG ACA GGA AAC 11880 
Asn Trp Leu Val Asn Ala Asp Arg Leu Val Ser Ser Lys Thr Gly Asn 
3825 3830 3835 

AGG TAT ATA CCT GGA GAG GGC CAC ACC CTA CAA GGG AAA CAT TAT GAA 11928 
Arg Tyr He Pro Gly Glu Gly His Thr Leu Gin Gly Lys His Tyr Glu 
3840 3845 3850 3855 

GAA CTG ATA CTG GCA AGG AAA CCG ATC GGT AAC TTT GAA GGG ACC GAT 11976 
Glu Leu He Leu Ala Arg Lys Pro He Gly Asn Phe Glu Gly Thr Asp 
3860 3865 3870 

AGG TAT AAC TTG GGG CCA ATA GTC AAT GTA GTG TTG AGG AGA CTA AAA 12024 
Arg Tyr Asn Leu Gly Pro He Val Asn Val Val Leu Arg Arg Leu Lys 
3875 3880 3885 

ATT ATG ATG ATG GCC CTG ATA GGA AGG GGG GTG TGAGCATGGT TGGCCCTTGA 12077 
He Met Met Met Ala Leu He Gly Arg Gly Val 
3890 3895 

TCGGGCCCTA TCAGTAGAAC CCTATTGTAA ATAACATTAA CTTATTAATT ATTTAGATAC 12137 

TATTATTTAT TTATTTATTT ATTTATTGAA TGAGCAAGTA CTGGTACAAA CTACCTCATG 12197 

TTACCACACT ACACTCATTT TAACAGCACT TTAGCTGGAG GGAAAACCCT GACGTCCACA 12257 

GTTGGACTAA GGTAATTTCC TAACGGC 12284 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3898 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Leu Asn His Phe Glu Leu Leu Tyr Lys Thr Ser Lys Gin Lys 
1 5 io 15 

Pro Val Gly Val Glu Glu Pro Val Tyr Asp Thr Ala Gly Arg Pro Leu 
20 25 30 
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Phe Gly Asn Pro Ser Glu Val His Pro Gin Ser Thr Leu Lys Leu Pro 
35 40 45 

His Asp Arg Gly Arg Gly Asp He Arg Thr Thr Leu Arg Asp Leu Pro 
50 55 60 

Arg Lys Gly Asp Cys Arg Ser Gly Asn His Leu Gly Pro Val Ser Glv 
65 70 75 80 

He Tyr He Lys Pro Gly Pro Val Tyr Tyr Gin Asp Tyr Thr Gly Pro 

85 90 95 

Val Tyr His Arg Ala Pro Leu Glu Phe Phe Asp Glu Ala Gin Phe Cys 
100 105 no 

Glu Val Thr Lys Arg He Gly Arg Val Thr Gly Ser Asp Gly Lys Leu 
115 120 125 

Tyr His He Tyr Val Cys Val Asp Gly Cys He Leu Leu Lys Leu Ala 
130 135 14Q 

Lys Arg Gly Thr Pro Arg Thr Leu Lys Trp He Arg Asn Phe Thr Asn 
145 150 155 leo 

Cys Pro Leu Trp Val Thr Ser Cys Ser Asp Asp Gly Ala Ser Gly Ser 
165 170 i 7 5 

Lys Asp Lys Lys Pro Asp Arg Met Asn Lys Gly Lys Leu Lys He Ala 
180 185 190 

Pro Arg Glu His Glu Lys Asp Ser Lys Thr Lys Pro Pro Asp Ala Thr 
195 200 205 

He Val Val Glu Gly Val Lys Tyr Gin He Lys Lys Lys Gly Lys Val 
210 215 220 

Lys Gly Lys Asn Thr Gin Asp Gly Leu Tyr His Asn Lys Asn Lys Pro 
225 230 235 240 

Pro Glu Ser Arg Lys Lys Leu Glu Lys Ala Leu Leu Ala Trp Ala Val 
245 250 255 

He Thr He Leu Leu Tyr Gin Pro Val Ala Ala Glu Asn He Thr Gin 
260 265 270 

Trp Asn Leu Ser Asp Asn Gly Thr Asn Gly He Gin Arg Ala Met Tvr 
27 5 280 285 

Leu Arg Gly Val Asn Arg Ser Leu His Gly He Trp Pro Glu Lys He 
290 295 300 

Cys Lys Gly Val Pro Thr His Leu Ala Thr Asp Thr Glu Leu Lys Glu 
305 310 315 3 20 

He Arg Gly Met Met Asp Ala Ser Glu Arg Thr Asn Tyr Thr Cys Cys 
325 330 335 



• o 
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Arg Leu Gin Arg His Glu Trp Asn Lys His Gly Trp Cys Asn Trp Tyr 
340 345 350 

Asn He Asp Pro Trp He Gin Leu Met Asn Arg Thr Gin Thr Asn Leu 
355 360 365 

Thr Glu Gly Pro Pro Asp Lys Glu Cys Ala Val Thr Cys Arg Tyr Asp 
370 375 380 

Lys Asn Thr Asp Val Asn Val Val Thr Gin Ala Arg Asn Arg Pro Thr 
385 390 395 400 

Thr Leu Thr Gly Cys Lys Lys Gly Lys Asn Phe Ser Phe Ala Gly Thr 
405 410 415 

Val He Glu Gly Pro Cys Asn Phe Asn Val Ser Val Glu Asp He Leu 
420 425 430 

Tyr Gly Asp His Glu Cys Gly Ser Leu Leu Gin Asp Thr Ala Leu Tyr 
435 440 445 

Leu Leu Asp Gly Met Thr Asn Thr He Glu Asn Ala Arg Gin Gly Ala 
450 455 460 

Ala Arg Val Thr Ser Trp Leu Gly Arg Gin Leu Ser Thr Ala Gly Lys 
465 470 475 480 

Lys Leu Glu Arg Arg Ser Lys Thr Trp Phe Gly Ala Tyr Ala Leu Ser 
485 490 495 

Pro Tyr Cys Asn Val Thr Arg Lys He Gly Tyr He Trp Tyr Thr Asn 
500 505 510 

Asn Cys Thr Pro Ala Cys Leu Pro Lys Asn Thr Lys He He Gly Pro 
515 520 525 

Gly Lys Phe Asp Thr Asn Ala Glu Asp Gly Lys He Leu His Glu Met 
530 535 540 

Gly Gly His Leu Ser Glu Phe Leu Leu Leu Ser Leu Val He Leu Ser 
545 550 555 560 

Asp Phe Ala Pro Glu Thr Ala Ser Thr Leu Tyr Leu He Leu His Tyr 
565 570 575 

Ala He Pro Gin Ser His Glu Glu Pro Glu Gly Cys Asp Thr Asn Gin 
580 585 590 

Leu Asn Leu Thr Val Lys Leu Arg Thr Glu Asp Val Val Pro Ser Ser 
595 600 605 

Val Trp Asn He Gly Lys Tyr Val Cys Val Arg Pro Asp Trp Trp Pro 
610 615 620 
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Tyr Glu Thr Lys Val Ala Leu Leu Phe Glu Glu Ala Gly Gin Val He 
625 630 635 640 

Lys Leu Val Leu Arg Ala Leu Arg Asp Leu Thr Arg Val Trp Asn Ser 
645 650 655 

Ala Ser Thr Thr Ala Phe Leu He Cys Leu He Lys Val Leu Arg Gly 
660 665 670 

Gin Val Val Gin Gly He He Trp Leu Leu Leu Val Thr Gly Ala Gin 
675 680 685 

Gly Arg Leu Ala Cys Lys Glu Asp Tyr Arg Tyr Ala He Ser Ser Thr 
690 695 700 

Asn Glu He Gly Leu Leu Gly Ala Glu Gly Leu Thr Thr Thr Trp Lys 
705 710 715 720 

Glu Tyr Ser His Gly Leu Gin Leu Asp Asp Gly Thr Val Lys Ala Val 
7 25 730 735 

Cys Thr Ala Gly Ser Phe Lys Val Thr Ala Leu Asn Val Val Ser Arg 
740 745 750 

Arg Tyr Leu Ala Ser Leu His Lys Arg Ala Leu Pro Thr Ser Val Thr 
755 760 765 

Phe Glu Leu Leu Phe Asp Gly Thr Asn Pro Ala He Glu Glu Met Asp 
770 775 780 

Asp Asp Phe Gly Phe Gly Leu Cys Pro Phe Asp Thr Ser Pro Val He 
785 79 <> 795 800 

Lys Gly Lys Tyr Asn Thr Thr Leu Leu Asn Gly Ser Ala Phe Tyr Leu 
8 °5 810 815 

Val Cys Pro He Gly Trp Thr Gly Val Val Glu Cys Thr Ala Val Ser 
820 825 830 

Pro Thr Thr Leu Arg Thr Glu Val Val Lys Thr Phe Arg Arg Asp Lys 
835 840 845 

Pro Phe Pro His Arg Val Asp Cys Val Thr Thr He Val Glu Lys Glu 
850 S55 860 

Asp Leu Phe His Cys Lys Leu Gly Gly Asn Trp Thr Cys Val Lys Gly 
865 870 S75 88 o 

Asp Pro Val Thr Tyr Lys Gly Gly Gin Val Lys Gin Cys Arg Trp Cys 
885 890 895 

Gly Phe Glu Phe Lys Glu Pro Tyr Gly Leu Pro His Tyr Pro He Gly 
900 905 9io 

Lys Cys He Leu Thr Asn Glu Thr Gly Tyr Arg Val Val Asp Ser Thr 
915 920 925 
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Asp Cys Asn Arg Asp Gly Val Val lie Ser Thr Glu Gly Glu His Glu 
930 935 940 

Cys Leu He Gly Asn Thr Thr Val Lys Val His Ala Leu Asp Glu Arq 
945 950 955 960 

Leu Gly Pro Met Pro Cys Arg Pro Lys Glu He Val Ser Ser Glu Gly 
965 970 975 

Pro Val Arg Lys Thr Ser Cys Thr Phe Asn Tyr Thr Lys Thr Leu Arq 
980 985 990 

Asn Lys Tyr Tyr Glu Pro Arg Asp Ser Tyr Phe Gin Gin Tyr Met Leu 
995 1000 1005 

Lys Gly Glu Tyr Gin Tyr Trp Phe Asn Leu Asp Val Thr Asp His His 
1010 1015 1020 

Thr Asp Tyr Phe Ala Glu Phe Val Val Leu Val Val Val Ala Leu Leu 
1025 1030 1035 1040 

Gly Gly Arg Tyr Val Leu Trp Leu He Val Thr Tyr He He Leu Thr 
1045 1050 1055 

Glu Gin Leu Ala Ala Gly Leu Gin Leu Gly Gin Gly Glu Val Val Leu 
1060 1065 1070 

He Gly Asn Leu He Thr His Thr Asp Asn Glu Val Val Val Tyr Phe 
1075 1080 1085 

Leu Leu Leu Tyr Leu Val He Arg Asp Glu Pro He Lys Lys Trp He 
1°90 1095 iioo 

Leu Leu Leu Phe His Ala Met Thr Asn Asn Pro Val Lys Thr He Thr 
H05 mo ins i 120 

Val Ala Leu Leu Met He Ser Gly Val Ala Lys Gly Gly Lys He Asp 
1125 H30 i 135 

Gly Gly Trp Gin Arg Gin Pro Val Thr Ser Phe Asp He Gin Leu Ala 
114° 1145 H50 

Leu. Ala Val Val Val Val Val Val Met Leu Leu Ala Lys Arg Asp Pro 
1155 H60 H65 

Thr Thr Phe Pro Leu Val He Thr Val Ala Thr Leu Arg Thr Ala Lys 
1170 1175 H80 

He Thr Asn Gly Phe Ser Thr Asp Leu Val He Ala Thr Val Ser Ala 
1185 H90 1195 1200 

Ala Leu Leu Thr Trp Thr Tyr He Ser Asp Tyr Tyr Lys Tyr Lvs Thr 
1205 1210 1215 
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Trp Leu Gin Tyr Leu Val Ser Thr Val Thr Gly He Phe Leu He Arg 
1220 1225 1230 

Val Leu Lys Gly He Gly Glu Leu Asp Leu His Ala Pro Thr Leu Pro 
1235 1240 1245 

Ser His Arg Pro Leu Phe Tyr He Leu Val Tyr Leu He Ser Thr Ala 
1250 1255 1260 

Val Val Thr Arg Trp Asn Leu Asp Val Ala Gly Leu Leu Leu Gin Cys 
12 65 1270 1275 1280 

Val Pro Thr Leu Leu Met Val Phe Thr Met Trp Ala Asp He Leu Thr 
1285 1290 1295 

Leu He Leu He Leu Pro Thr Tyr Glu Leu Thr Lys Leu Tyr Tyr Leu 
1300 1305 1310 

Lys Glu Val Lys He Gly Ala Glu Arg Gly Trp Leu Trp Lys Thr Asn 
1315 1320 1325 

Tyr Lys Arg Val Asn Asp He Tyr Glu Val Asp Gin Thr Ser Glu Gly 
1330 1335 1340 

Val Tyr Leu Phe Pro Ser Lys Gin Arg Thr Ser Ala He Thr Ser Thr 
1345 1350 1355 1360 

Met Leu Pro Leu He Lys Ala He Leu He Ser Cys He Ser Asn Lys 
1365 1370 1375 

Trp Gin Leu He Tyr Leu Leu Tyr Leu He Phe Glu Val Ser Tyr Tyr 
1380 1385 1390 

Leu His Lys Lys Val He Asp Glu He Ala Gly Gly Thr Asn Phe Val 
1395 1400 1405 

Ser Arg Leu Val Ala Ala Leu He Glu Val Asn Trp Ala Phe Asp Asn 
1410 1415 1420 

Glu Glu Val Lys Gly Leu Lys Lys Phe Phe Leu Leu Ser Ser Arg Val 
1425 1430 1435 1440 

Lys Glu Leu He He Lys His Lys Val Arg Asn Glu Val Val Val Arg 
1445 1450 1455 

Trp Phe Gly Asp Glu Glu He Tyr Gly Met Pro Lys Leu He Gly Leu 
1460 1465 1470 

Val Lys Ala Ala Thr Leu Ser Arg Asn Lys His Cys Met Leu Cys Thr 
1475 1480 1485 

Val Cys Glu Asp Arg Asp Trp Arg Gly Glu Thr Cys Pro Lys Cys Gly 
1490 1495 1500 

Arg Phe Gly Pro Pro Val Val Cys Gly Met Thr Leu Ala Asp Phe Glu 
1 5 °5 1510 1515 1520 



55 



Glu Lys His Tyr Lys Arg lie Phe lie Arg Glu Asp Gin Ser Gly Gly 
1525 1530 1535 

Pro Leu Arg Glu Glu His Ala Gly Tyr Leu Gin Tyr Lys Ala Arg Gly 
1540 1545 1550 

Gin Leu Phe Leu Arg Asn Leu Pro Val Leu Ala Thr Lys Val Lys Met 
1555 1560 1565 

Leu Leu Val Gly Asn Leu Gly Thr Glu lie Gly Asp Leu Glu His Leu 
1570 1575 1580 

Gly Trp Val Leu Arg Gly Pro Ala Val Cys Lys Lys Val Thr Glu His 
1585 1590 1595 1600 

Glu Arg Cys Thr Thr Ser lie Met Asp Lys Leu Thr Ala Phe Phe Gly 
1605 1610 1615 

Val Met Pro Arg Gly Thr Thr Pro Arg Ala Pro Val Arg Phe Pro Thr 
1620 1625 1630 

Ser Leu Leu Lys lie Arg Arg Gly Leu Glu Thr Gly Trp Ala Tyr Thr 
1635 1640 1645 

His Gin Gly Gly lie Ser Ser Val Asp His Val Thr Cys Gly Lys Asp 
1650 1655 1660 

Leu Leu Val Cys Asp Thr Met Gly Arg Thr Arg Val Val Cys Gin Ser 
1665 1670 1675 1680 

Asn Asn Lys Met Thr Asp Glu Ser Glu Tyr Gly Val Lys Thr Asp Ser 
1685 1690 1695 

Gly Cys Pro Glu Gly Ala Arg Cys Tyr Val Phe Asn Pro Glu Ala Val 
1700 1705 1710 

Asn lie Ser Gly Thr Lys Gly Ala Met Val His Leu Gin Lys Thr Gly 
1715 1720 1725 

Gly Glu Phe Thr Cys Val Thr Ala Ser Gly Thr Pro Ala Phe Phe Asp 
1730 1735 1740 

Leu Lys Asn Leu Lys Gly Trp Ser Gly Leu Pro lie Phe Glu Ala Ser 
1745 1750 1755 1760 

Ser Gly Arg Val Val Gly Arg Val Lys Val Gly Lys Asn Glu Asp Ser 
1765 1770 1775 

Lys Pro Thr Lys Leu Met Ser Gly lie Gin Thr Val Ser Lys Ser Ala 
1780 1785 1790 

Thr Asp Leu Thr Glu Met Val Lys Lys lie Thr Thr Met Asn Arg Gly 
1795 1800 1805 



Glu Phe Arg Gin lie Thr Leu Ala Thr Gly Ala Gly Lys Thr Thr Glu 
1810 1815 1820 

Leu Pro Arg Ser Val lie Glu Glu lie Gly Arg His Lys Arg Val Leu 
1825 1830 1835 1840 

Val Leu lie Pro Leu Arg Ala Ala Ala Glu Ser Val Tyr Gin Tyr Met 
1845 1850 1855 

Arg Gin Lys His Pro Ser lie Ala Phe Asn Leu Arg lie Gly Glu Met 
1860 1865 1870 

Lys Glu Gly Asp Met Ala Thr Gly lie Thr Tyr Ala Ser Tyr Gly Tyr 
1875 1880 1885 

Phe Cys Gin Met Ser Gin Pro Lys Leu Arg Ala Ala Met Val Glu Tyr 
1890 1895 1900 

Ser Phe lie Phe Leu Asp Glu Tyr His Cys Ala Thr Pro Glu Gin Leu 
1905 1910 1915 1920 

Ala lie Met Gly Lys lie His Arg Phe Ser Glu Asn Leu Arg Val Val 
1925 1930 1935 

Ala Met Thr Ala Thr Pro Ala Gly Thr Val Thr Thr Thr Gly Gin Lys 
1940 1945 1950 

His Pro lie Glu Glu Phe lie Ala Pro Glu Val Met Lys Gly Glu Asp 
1955 1960 1965 

Leu Gly Ser Glu Tyr Leu Asp lie Ala Gly Leu Lys lie Pro Val Glu 
1970 1975 1980 

Glu Met Lys Asn Asn Met Leu Val Phe Val Pro Thr Arg Asn Met Ala 
1985 1990 1995 2000 

Val Glu Ala Ala T.ys Lys Leu Lys Ala Lys Gly Tyr Asn Ser Gly Tyr 
2005 2010 2015 

Tyr Tyr Ser Gly Glu Asp Pro Ser Asn Leu Arg Val Val Thr Ser Gin 
2020 2025 2030 

Ser Pro Tyr Val Val Val Ala Thr Asn Ala lie Glu Ser Gly Val Thr 
2035 2040 2045 

Leu Pro Asp Leu Asp Val Val Val Asp Thr Gly Leu Lys Cys Glu Lys 
2050 2055 2060 

Arg lie Arg Leu Ser Pro Lys Met Pro Phe lie Val Thr Gly Leu Lys 
2065 2070 2075 2080 

Arg Met Ala Val Thr lie Gly Glu Gin Ala Gin Arg Arg Gly Arg Val 
2085 2090 2095 

Gly Arg Val Lys Pro Gly Arg Tyr Tyr Arg Ser Gin Glu Thr Pro Val 
2100 2105 2110 
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Gly Ser Lys Asp Tyr His Tyr Asp Leu Leu Gin Ala Gin Arg Tyr Gly 
2115 2120 2125 

lie Glu Asp Gly lie Asn lie Thr Lys Ser Phe Arg Glu Met Asn Tyr 
2130 2135 2140 

Asp Trp Ser Leu Tyr Glu Glu Asp Ser Leu Met lie Thr Gin Leu Glu 
2145 2150 2155 2160 

lie Leu Asn Asn Leu Leu lie Ser Glu Glu Leu Pro Met Ala Val Lys 
2165 2170 2175 

Asn lie Met Ala Arg Thr Asp His Pro Glu Pro lie Gin Leu Ala Tyr 
2180 2185 2190 

Asn Ser Tyr Glu Thr Gin Val Pro Val Leu Phe Pro Lys He Arg Asn 
2195 2200 2205 

Gly Glu Val Thr Asp Thr Tyr Asp Asn Tyr Thr Phe Leu Asn Ala Arg 
2210 2215 2220 

Lys Leu Gly Asp Asp Val Pro Pro Tyr Val Tyr Ala Thr Glu Asp Glu 
2225 2230 2235 2240 

Asp Leu Ala Val Glu Leu Leu Gly Leu Asp Trp Pro Asp Pro Gly Asn 
2245 2250 2255 

Gin Gly Thr Val Glu Ala Gly Arg Ala Leu Lys Gin Val Val Gly Leu 
2260 2265 2270 

Ser Thr Ala Glu Asn Ala Leu Leu Val Ala Leu Phe Gly Tyr Val Gly 
2275 2280 2285 

Tyr Gin Ala Leu Ser Lys Arg His He Pro Val Val Thr Asp He Tyr 
2290 2295 2300 

Ser Val Glu Asp His Arg Leu Glu Asp Thr Thr His Leu Gin Tyr Ala 
2305 2310 2315 2320 

Pro Asn Ala He Lys Thr Glu Gly Lys Glu Thr Glu Leu Lys Glu Leu 
2325 2330 2335 

Ala Gin Gly Asp Val Gin Arg Cys Val Glu Ala Val Thr Asn Tyr Ala 
2340 2345 2350 

Arg Glu Gly He Gin Phe Met Lys Ser Gin Ala Leu Lys Val Arg Glu 
2355 2360 2365 

Thr Pro Thr Tyr Lys Glu Thr Met Asn Thr Val Ala Asp Tyr Val Lys 
2370 2375 2380 

Lys Phe He Glu Ala Leu Thr Asp Ser Lys Glu Asp He He Lys Tyr 
2385 2390 2395 2400 
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Gly Leu Trp Gly Ala His Thr Ala Leu Tyr Lys Ser He Gly Ala Arg 
2405 2410 2415 

Leu Gly His Glu Thr Ala Phe Ala Thr Leu Val Val Lys Trp Leu Ala 
2420 2425 2430 

Phe Gly Gly Glu Ser He Ser Asp His He Lys Gin Ala Ala Thr Asp 
2435 2440 2445 

Leu Val Val Tyr Tyr He He Asn Arg Pro Gin Phe Pro Gly Asp Thr 
2450 2455 2460 

Glu Thr Gin Gin Glu Gly Arg Lys Phe Val Ala Ser Leu Leu Val Ser 
2465 2470 2475 2480 

Ala Leu Ala Thr Tyr Thr Tyr Lys Ser Trp Asn Tyr Asn Asn Leu Ser 
2485 2490 2495 

Lys He Val Glu Pro Ala Leu Ala Thr Leu Pro Tyr Ala Ala Lys Ala 
2500 2505 2510 

Leu Lys Leu Phe Ala Pro Thr Arg Leu Glu Ser Val Val He Leu Ser 
2515 2520 2525 

Thr Ala lie Tyr Lys Thr Tyr Leu Ser He Arg Arg Gly Lys Ser Asp 
2530 2535 2540 

Gly Leu Leu Gly Thr Gly Val Ser Ala Ala Met Glu He Met Ser Gin 
254 5 2550 2555 2560 

Asn Pro Val Ser Val Gly He Ala Val Met Leu Gly Val Gly Ala Val 
2565 2570 2575 

Ala Ala His Asn Ala He Glu Ala Ser Glu Gin Lys Arg Thr Leu Leu 
2580 2585 2590 

Met Lys Val Phe Val Lys Asn Phe Leu Asp Gin Ala Ala Thr Asp Glu 
2595 2600 2605 

Leu Val Lys Glu Ser Pro Glu Lys He He Met Ala Leu Phe Glu Ala 
2610 2615 2620 

Val Gin Thr Val Gly Asn Pro Leu Arg Leu Val Tyr His Leu Tyr Gly 
2625 2630 2635 2640 

Val Phe Tyr Lys Gly Trp Glu Ala Lys Glu Leu Ala Gin Arg Thr Ala 
2645 2650 2655 

Gly Arg Asn Leu Phe Thr Leu He Met Phe Glu Ala Val Glu Leu Leu 
2 660 2665 2670 

Gly Val Asp Ser Glu Gly Lys He Arg Gin Leu Ser Ser Asn Tyr He 
2675 2680 2685 

Leu Glu Leu Leu Tyr Lys Phe Arg Asp Asn He Lys Ser Ser Val Arg 
2690 2695 2700 
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Glu lie Ala lie Ser Trp Ala Pro Ala Pro Phe Ser Cys Asp Trp Thr 
2705 2710 2715 2720 

Pro Thr Asp Asp Arg lie Gly Leu Pro His Asp Asn Tyr Leu Arg Val 
2725 2730 2735 



Glu Thr Lys Cys Pro Cys Gly Tyr Arg Met Lys Ala Val Lys Asn Cys 
2740 2745 2750 

Ala Gly Glu Leu Arg Leu Leu Glu Glu Gly Gly Ser Phe Leu Cys Arg 
2755 2760 2765 

Asn Lys Phe Gly Arg Gly Ser Gin Asn Tyr Arg Val Thr Lys Tyr Tyr 
2770 2775 2780 

Asp Asp Asn Leu Ser Glu lie Lys Pro Val lie Arg Met Glu Gly His 
2785 2790 2795 2800 

Val Glu Leu Tyr Tyr Lys Gly Ala Thr He Lys Leu Asp Phe Asn Asn 
2805 2810 2815 

Ser Lys Thr Val Leu Ala Thr Asp Lys Trp Glu Val Asp His Ser Thr 
2820 2825 2830 



Leu Val Arg Ala Leu Lys Arg Tyr Thr Gly Ala Gly Tyr Arg Gly Ala 
2835 2840 2845 

Tyr Leu Gly Glu Lys Pro Asn His Lys His Leu He Gin Arg Asp Cys 
2850 2855 2860 
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Ala Thr He Thr Lys Asp Lys Val Cys Phe He Lys Met Lys Arg Gly 
2865 2870 2875 2880 

Cys Ala Phe Thr Tyr Asp Leu Ser Leu His Asn Leu Thr Arg Leu He 
2885 2890 2895 

Glu Leu Val His Lys Asn Asn Leu Glu Asp Arg Glu He Pro Ala Val 
2900 2905 2910 



Thr Val Thr Thr Trp Leu Ala Tyr Thr Phe Val Asn Glu Asp He Gly 
2915 2920 2925 

Thr He Lys Pro Thr Phe Gly Glu Lys Val Thr Pro Glu Lys Gin Glu 
2930 2935 2940 

Glu Val Val Leu Gin Pro Ala Val Val Val Asp Thr Thr Asp Val Ala 
2^45 2950 2955 2960 



Val Thr Val Val Gly Glu Thr Ser Thr Met Thr Thr Gly Glu Thr Pro 
2965 2970 2975 

Thr Thr Phe Thr Ser Leu Gly Ser Asp Ser Lys Val Arg Gin Val Leu 
2980 2985 2990 
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Lys Leu Gly Val Asp Asp Gly Gin Tyr Pro Gly Pro Asn Gin Gin Arg 
2995 3000 3005 

Ala Ser Leu Leu Glu Ala lie Gin Gly Val Asp Glu Arg Pro Ser Val 
3010 3015 3020 

Leu lie Leu Gly Ser Asp Lys Ala Thr Ser Asn Arg Val Lys Thr Ala 
3025 3030 3035 3040 

Lys Asn Val Lys lie Tyr Arg Ser Arg Asp Pro Leu Glu Leu Arg Glu 
3045 3050 3055 

Met Met Lys Arg Gly Lys lie Leu Val Val Ala Leu Ser Arg Val Asp 
3060 3065 3070 

Thr Ala Leu Leu Lys Phe Val Asp Tyr Lys Gly Thr Phe Leu Thr Arg 
3075 3080 3085 

Glu Thr Leu Glu Ala Leu Ser Leu Gly Lys Pro Lys Lys Arg Asp lie 
3090 3095 3100 

Thr Lys Ala Glu Ala Gin Trp Leu Leu Arg Leu Glu Asp Gin lie Glu 
3105 3110 3115 3120 

Glu Leu Pro Asp Trp Phe Ala Ala Lys Glu Pro lie Phe Leu Glu Ala 
3125 3130 3135 

Asn He Lys Arg Asp Lys Tyr His Leu Val Gly Asp He Ala Thr He 
3140 3145 L 3150 

Lys Glu Lys Ala Lys Gin Leu Gly Ala Thr Asp Ser Thr Lys He Ser 
3155 3160 3165 

Lys Glu Val Gly Ala Lys Val Tyr Ser Met Lys Leu Ser Asn Trp Val 
3170 3175 3180 

He Gin Glu Glu Asn Lys Gin Gly Scr Leu Ala Pro Leu Phe Glu Glu 
3185 3190 3195 3200 

Leu Leu Gin Gin Cys Pro Pro Cly Gly Gin Asn Lys Thr Thr His Met 
3205 3210 3215 

Val Ser Ala Tyr Gin Leu Ala Gin Gly Asn Trp Val Pro Val Ser Cys 
3220 3225 3230 

His Val Phe Met Gly Thr He Pro Ala Arg Arg Thr Lys Thr His Pro 
3235 3240 . 3245 

Tyr Glu Ala Tyr Val Lys Leu Arg Glu Leu Val Asp Glu His Lys Met 
3250 3255 3260 

Lys Ala Leu Cys Gly Gly Ser Gly Leu Ser Lys His Asn Glu Trp Val 
3265 3270 3275 3280 

He Gly Lys Val Lys Tyr Gin Gly Asn Leu Arg Thr Lys His Met Leu 
3285 3290 3295 
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Asn Pro Gly Lys Val Ala Glu Gin Leu His Arg Glu Gly Tyr Arg His 
3300 3305 3310 

Asn Val Tyr Asn Lys Thr lie Gly Ser Val Met Thr Ala Thr Gly lie 
3315 3320 3325 

Arg Leu Glu Lys Leu Pro Val Val Arg Ala Gin Thr Asp Thr Thr Asn 
3330 3335 3340 

Phe His Gin Ala lie Arg Asp Lys lie Asp Lys Glu Glu Asn Leu Gin 
3345 3350 3355 3360 

Thr Pro Gly Leu His Lys Lys Leu Met Glu Val Phe Asn Ala Leu Lys 
3365 3370 3375 

Arg Pro Glu Leu Glu Ala Ser Tyr Asp Ala Val Asp Trp Glu Glu Leu 
3380 3385 3390 

Glu Arg Gly lie Asn Arg Lys Gly Ala Ala Gly Phe Phe Glu Arg Lys 
3395 3400 3405 

Asn lie Gly Glu Val Leu Asp Ser Glu Lys Asn Lys Val Glu Glu Val 
3410 3415 3420 

lie Asp Ser Leu Lys Lys Gly Arg Asn lie Arg Tyr Tyr Glu Thr Ala 
3425 3430 3435 3440 

lie Pro Lys Asn Glu Lys Arg Asp Val Asn Asp Asp Trp Thr Ala Gly 
3445 3450 3455 

Asp Phe Val Asp Glu Lys Lys Pro Arg Val lie Gin Tyr Pro Glu Ala 
3460 3465 3470 

Lys Thr Arg Leu Ala lie Thr Lys Val Met Tyr Lys Trp Val Lys Gin 
3475 3480 3485 

Lys Pro Val Val lie Pro Gly Tyr Glu Gly Lys Thr Pro Leu Phe Gin 
3490 3495 3500 

lie Phe Asp Lys Val Lys Lys Glu Trp Asp Gin Phe Gin Asn Pro Val 
3505 3510 3515 3520 

Ale* Val Ser Phe Asp Thr Lys Ala Trp Asp Thr Gin Val Thr Thr Arg 
3525 3530 3535 

Asp Leu Glu Leu lie Arg Asp lie Gin Lys Phe Tyr Phe Lys Lys Lys 
3540 3545 3550 

Trp His Lys Phe lie Asp Thr Leu Thr Lys His Met Ser Glu Val Pro 
3555 3560 3565 

Val lie Ser Ala Asp Gly Glu Val Tyr lie Arg Lys Gly Gin Arg Gly 
3570 3575 3580 
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Ser Gly Gin Pro Asp Thr Ser Ala Gly Asn Ser Met Leu Asn Val Leu 
3585 3590 3595 3600 

Thr Met Val Tyr Ala Phe Cys Glu Ala Thr Gly Val Pro Tyr Lys Ser 
3605 3610 3615 

Phe Asp Arg Val Ala Lys lie His Val Cys Gly Asp Asp Gly Phe Leu 
3620 3625 3630 

He Thr Glu Arg Ala Leu Gly Glu Lys Phe Ala Ser Lys Gly Val Gin 
3635 3640 3645 

He Leu Tyr Glu Ala Gly Lys Pro Gin Lys He Thr Glu Gly Asp Lys 
3650 3655 3660 

Met Lys Val Ala Tyr Gin Phe Asp Asp He Glu Phe Cys Ser His Thr 
3665 3670 3675 3680 

Pro Val Gin Val Arg Trp Ser Asp Asn Thr Ser Ser Tyr Met Pro Gly 
3685 3690 3695 

Arg Asn Thr Thr Thr He Leu Ala Lys Met Ala Thr Arg Leu Asp Ser 
3700 3705 3710 

Ser Gly Glu Arg Gly Thr He Ala Tyr Glu Lys Ala Val Ala Phe Ser 
3715 3720 3725 

Phe Leu Leu Met Tyr Ser Trp Asn Pro Leu He Arg Arg He Cys Leu 
3730 3735 3740 

Leu Val Leu Ser Thr Glu Leu Gin Val Arg Pro Gly Lys Ser Thr Thr 
3745 3750 3755 3760 

Tyr Tyr Tyr Glu Gly Asp Pro He Ser Ala Tyr Lys Glu Val He Gly 
3765 3770 3775 

His Asn Leu Phe Asp Leu Lys Arg Thr Ser Phe Glu Lys Leu Ala Lys 
3780 3785 3790 

Leu Asn Leu Ser Met Ser Thr Leu Gly Val Trp Thr Arg His Thr Ser 
3795 3800 3805 

Lys Arg Leu Leu Gin Asp Cys Val Asn Val Gly Thr Lys Glu Gly Asn 
- 3810 3815 3820 

Trp Leu Val Asn Ala Asp Arg Leu Val Ser Ser Lys Thr Gly Asn Arg 
3825 3830 3835 3840 

Tyr He Pro Gly Glu Gly His Thr Leu Gin Gly Lys His Tyr Glu Glu 
3845 3850 3855 

Leu He Leu Ala Arg Lys Pro He Gly Asn Phe Glu Gly Thr Asp Arg 
3860 3865 3870 

Tyr Asn Leu Gly Pro He Val Asn Val Val Leu Arg Arg Leu Lys He 
3875 3880 3885 
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Met Met Met Ala Leu lie Gly Arg Gly Val 
3890 3895 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..33 

(D) OTHER INFORMATION: /label= primer_l 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCTACTAACC ACGTTAAGTG CTGTGACTTT AAA 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..39 

(D) OTHER INFORMATION: /label= primer_2 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TTCTGTTCTC AAGGTTGTGG GGCTCACTGC TGTGCACTC 




64 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..16 

(D) OTHER INFORMATION: /label= Adaptor_l 



/note= "Upper strand of Bam HI - Hinf I adaptor, 
containing ATG at 364-366" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GATCCACCAT GGAGTT 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..16 

(D) OTHER INFORMATION: /label= Adaptor_2 



/note= "Lower strand of Bam HI - Hinf I adaptor, 
containing ATG at 364-366" 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GTGGTACCTC AACTTA 



16 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..10 

(D) OTHER INFORMATION: /label= Adaptor_3 

/note= "Double stranded Stu I - Eco RI blunt 
adaptor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCCTGAATTC 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /label= Adaptor_4 

/note= "Upper strand of Bgl II - BamH I adaptor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GATCCACCAT GGGGGCCCTG T 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..14 

(D) OTHER INFORMATION: /label= Adaptor_5 

/note= "Lower strand of Bgl II - BamH I adaptor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTGGTACCCC CGGG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /label= Adaptor_6 

/note= "Upper strand of Ban I - Eco R I adaptor" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



GTGCCTATGC CTGAG 



• o • 
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(2) INFORMATION FOR SEQ ID NO: 11: 

n^s: ^ (i) SEQUENCE CHARACTERISTICS: 

iplp WmM (A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /label= Adaptor_7 

/note= "Lower strand of Ban I - Eco R I adaptor" 

( xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GATACGGACT CTTAA 15 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: lambda gtll clone 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..300 

(D) OTHER INFORMATION: /note= "Part of 0.8 kb insert of 
Lambda gtll" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AGT GAC AAC GGC ACT AAT GGT ATT CAG CGA GCC ATG TAT CTT AGA GGG 48 
Ser Asp Asn Gly Thr Asn Gly He Gin Arg Ala Met Tyr Leu Arg Gly 
1 5 io 15 

GTT AAC AGG AGC TTA CAT GGG ATC TGG CCC GAG AAA ATA TGC AAG GGG 96 
Val Asn Arg Ser Leu His Gly He Trp Pro Glu Lys He Cys Lys Gly 
20 25 30 

GTC CCC ACT CAT CTG GCC ACT GAC ACG GAA CTG AAA GAG ATA CGC GGG 144 
Val Pro Thr His Leu Ala Thr Asp Thr Glu Leu Lys Glu He Arq Glv 
35 40 45 
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ATG ATG GAT GCC AGC GAG AGG ACA AAC TAT ACG TGC TGT AGG TTA CAA 192 
Met Met Asp Ala Ser Glu Arg Thr Asn Tyr Thr Cys Cys Arg Leu Gin 
50 55 60 

AGA CAT GAA TGG AAC AAA CAT GGA TGG TGT AAC TGG TAC AAC ATA GAC 240 
Arg His Glu Trp Asn Lys His Gly Trp Cys Asn Trp Tyr Asn He Asp 
65 70 75 80 

CCT TGG ATT CAG TTA ATG AAC AGG ACC CAA ACA AAT TTG ACA GAA GGC 288 
Pro Trp He Gin Leu Met Asn Arg Thr Gin Thr Asn Leu Thr Glu Gly 

85 90 95 



CCT CCA GAT AAG 
Pro Pro Asp Lys 
100 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

u \ 1 Ser As P As n Gly Thr Asn Gly He Gin Arg Ala Met Tyr Leu Arg Gly 

; J I 1 5 io is 

0 \ Val Asn A **g Ser Leu His Gly He Trp Pro Glu Lys He Cys Lys Glv 

J= 20 25 30 

famd ' Val Pro Thr His Ijeu Ala Thr Asp Thr Glu Leu Lys Glu He Ara Glv 

35 40 45 

Met Met Asp Ala Ser Glu Arg Thr Asn Tyr Thr Cys Cys Arg Leu Gin 
50 55 60 

Arg His Glu Trp Asn Lys His Gly Trp Cys Asn Trp Tyr Asn He Asp 
65 70 75 80 

Pro Trp He Gin Leu Met Asn Arg Thr Gin Thr Asn Leu Thr Glu Gly 

85 90 95 

Pro Pro Asp Lys 
100 
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